Picture for Martha White

Martha White

Fine-Tuning without Performance Degradation

Add code
May 01, 2025
Viaarxiv icon

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

Add code
Nov 22, 2024
Viaarxiv icon

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 2 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 3 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Figure 4 for Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
Viaarxiv icon

q-exponential family for policy optimization

Add code
Aug 14, 2024
Viaarxiv icon

The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning

Add code
Jul 26, 2024
Viaarxiv icon

Investigating the Interplay of Prioritized Replay and Generalization

Add code
Jul 12, 2024
Figure 1 for Investigating the Interplay of Prioritized Replay and Generalization
Figure 2 for Investigating the Interplay of Prioritized Replay and Generalization
Figure 3 for Investigating the Interplay of Prioritized Replay and Generalization
Figure 4 for Investigating the Interplay of Prioritized Replay and Generalization
Viaarxiv icon

Position: Benchmarking is Limited in Reinforcement Learning Research

Add code
Jun 23, 2024
Viaarxiv icon

Demystifying the Recency Heuristic in Temporal-Difference Learning

Add code
Jun 18, 2024
Figure 1 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 2 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 3 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 4 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Viaarxiv icon

A New View on Planning in Online Reinforcement Learning

Add code
Jun 03, 2024
Figure 1 for A New View on Planning in Online Reinforcement Learning
Figure 2 for A New View on Planning in Online Reinforcement Learning
Figure 3 for A New View on Planning in Online Reinforcement Learning
Figure 4 for A New View on Planning in Online Reinforcement Learning
Viaarxiv icon

Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL

Add code
Apr 02, 2024
Figure 1 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Figure 2 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Figure 3 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Figure 4 for Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL
Viaarxiv icon